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Abstract 

Additive asynchronous and cyclostationary impulsive noise limits communication performance in 
OFDM powerline communication (PLC) systems. Conventional OFDM receivers assume additive white 
Gaussian noise and hence experience degradation in communication performance in impulsive noise. 
Alternate designs assume a parametric statistical model of impulsive noise and use the model parameters 
in mitigating impulsive noise. These receivers require overhead in training and parameter estimation, 
and degrade due to model and parameter mismatch, especially in highly dynamic environments. In this 
paper, we model impulsive noise as a sparse vector in the time domain without any other assumptions, 
and apply sparse Bayesian learning methods for estimation and mitigation without training. We propose 
three iterative algorithms with different complexity vs. performance trade-offs: (1) we utilize the noise 
projection onto null and pilot tones to estimate and subtract the noise impulses; (2) we add the 
information in the data tones to perform joint noise estimation and OFDM detection; (3) we embed 
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our algorithm into a decision feedback structure to further enhance the performance of coded systems. 
When compared to conventional OFDM PLC receivers, the proposed receivers achieve SNR gains of 
up to 9 dB in coded and 10 dB in uncoded systems in the presence of impulsive noise. 

Index Terms 

Asynchronous impulsive noise, cyclostationary noise, PLC, OFDM, sparse Bayesian learning. 

I. Introduction 

Unlike traditional electric grids that carry one-way flow of power from generators to customers, 
smart grids use two-way flows of information to create an intelligent energy delivery network. 
Various technologies have emerged to facilitate data communications throughout the grid, es- 
pecially in two scenarios: outdoor communications between local utilities and customers, and 
indoor communications for home area networks. The purpose is to support smart grid applications 
such as smart metering and real-time energy management by the utility and the home users. 

Powerline communications (PLC) have been attractive as a solution for smart grid communi- 
cations. For outdoor communications, there has been a lot of interest in developing narrowband 
(NB) PLC systems that operate in the 3-500 kHz band to provide data rates up to 800 kbps 
(2j. Examples of NB PLC systems are specified in the industry developed standards PRIME [[3) 
and G3 J4), and recent international standards IEEE P1901.2 and ITU-T G.hnem. For indoor 
communications, broadband (BB) PLC systems that target higher data rates (up to 200 Mbps) 
utilizing the 1.8-250 MHz band have been standardized in IEEE P1901 and ITU-T G.hn. 

One of the major impairments in PLC systems is the additive non-Gaussian powerline noise. 
The noise can generally be decomposed into spectrally-shaped background noise, cyclostationary 
impulsive noise, and asynchronous impulsive noise [5]. The impulsive noise consists of short 
bursts of noise with power spectral density significantly higher than background noise. The 
ocurrence of noise bursts can be periodic or random, hence the terms "cyclostationary" and 
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"asynchronous", respectively. Recent field measurements on outdoor medium- voltage (MV) and 
low-voltage (LV) power lines have shown that cyclostationary impulsive noise synchronous to 
half the AC main's cycle is the dominant noise component in the 3-500 kHz band [6|. In addition, 
the ever-increasing deployment of heterogeneous PLC systems is leading to severe co-channel 
interference that is asynchronous impulsive noise in nature (7J. The asynchronous impulsive 
noise is also more significant in indoor BB PLC systems due to the switching transients of 
electrical appliances [[5J. 

Orthogonal frequency division multiplexing (OFDM) is adopted in most modern PLC stan- 
dards, primarily due to its advantages in combating frequency-selective channels and spectrally- 
shaped noise. The presence of noise bursts as much as 50 dB above background noise limits the 
overall channel capacity by decreasing the signal-to-noise ratio (SNR) in certain frequency band 
and time durations [|5j, [|6j. Various statistical properties of the noise can be utilized to improve 
the reliability and throughput of the communication system in impulsive noise environments. In 
particular, these statistics can be used to tune the performance of the transmitter and receiver to 
the specific noise scenario represented by those statistics [[8j-[10|. Although such an approach 



performs well in the specific noise scenario, it is inflexible and requires significant design effort. 
A simpler and effective approach is to denoise the communication signal before detection. In 
this approach, a noise estimation block would provide a noise estimate and subtract it from the 
received signal before passing it to the conventional OFDM receiver. 

In this work we aim to mitigate both asynchronous and cyclostationary impulsive noise 
in OFDM PLC receivers by proposing three denoising algorithms based on sparse Bayesian 
learning (SBL) techniques pT| . We first target the impulsive noise whose bursts are much 
shorter than an OFDM symbol (e.g. asynchronous impulsive noise). This noise is sparse in 
the time domain and can be estimated using SBL either by observing non-data tones such as 
nulls and pilots or by jointly decoding across all tones. Since the algorithms do not make any 
assumptions on the noise models and do not require extra training, we refer to them as "non- 



parametric" techniques. To deal with the impulsive noise with bursts spanning over one or more 
OFDM symbols (e.g. cyclostationary impulsive noise), we adopt a time-domain interleaving 



OFDM transceiver structure [12] where the transmitted and received signals are interleaved and 
deinterleaved, respectively, in the time domain across multiple OFDM symbols. The effect of 
the deinterleaver is to spread the long noise bursts into shorter ones. After deinterleaving, noise 
within each OFDM symbol becomes sparse in the time domain and we can apply the proposed 
denoising methods. 

The rest of the paper is organized as follows. In Section [II] we discuss the nature of both 
asynchronous and cyclostationary impulsive noise in PLC networks and the corresponding sta- 
tistical noise models. Although our proposed methods are noise model independent, these models 
are useful for simulating various impulsive noise scenarios to test our algorithms. We review 
existing receiver algorithms for impulsive noise mitigation in Section ITTTJ Having established the 



system model in Section IV we introduce the three non-parametric impulsive noise mitigation 
algorithms in Section fVl Then we perform complexity analysis and present a low-complexity 



implementation of the first proposed algorithm in Section VI To demonstrate the performance 



of our algorithms, simulation results are presented and discussed in Section VII 



II. Statistical Modeling of Impulsive Noise in PLC 

Noise in communication systems is typically modeled as a Gaussian random process. While 
an appropriate model for the thermal noise in the receiver circuitry, it fails to capture the 
characteristics of the noise and interference in PLC networks [|5J. Various power electronic 
devices, especially those with switching circuitry, inject random or periodic emissions of noise 
into the connected power lines. The resulting noise deviates from the Gaussian model and is 
impulsive in nature. Depending on the operating frequency range, we distinguish between two 
types of impulsive noise in PLC: the asynchronous and the cyclostationary impulse noise. 
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A. Asynchronous Impulsive Noise Modeling 

In PLC networks, asynchronous impulsive noise arises from random emission events occur- 
ring at residential and industrial sites. Typical interference sources include appliance switching 
transients and uncoordinated PLC transmissions due to co-existence issues. This type of noise 



is dominant in the higher frequency ranges from several hundred kHz to 20 MHz [13]. Various 
studies have empirically fitted the first order statistics of the noise to the Gaussian mixture, 
Middleton Class A, or Symmetric Alpha Stable (SaS) models p"4fl . Statistical-physical models 
of the asynchronous impulsive noise have been analytically derived based on PLC network 
topologies, and by modeling the random emissions as a temporal Poisson point process Q. 
The three scenarios analyzed in [jTJ are given in Fig. [T] and TABLE [I] along with the first order 
statistics of the asynchronous impulsive noise. 

For convenience of the discussion in later sections, we briefly describe the Gaussian mixture 
and the Middleton Class A models as follows. 

1 ) Gaussian Mixture Model: A random variable Z has a Gaussian mixture distribution if its 
probability density function (pdf) is a weighted summation of different Gaussian distributions 

K 

f z (z) = J2^-J^(z;0, lk ), (1) 

k=0 

where Af(z; 0, 7&) denotes a Gaussian pdf with zero mean and variance 7*,, and n k is the mixing 
probability of the A;-th Gaussian component. 



2) Middleton Class A Model: Middleton Class A model [15|, with an overlapping factor 
A G [10~ 2 , 1] and power ratio f2 6 [10~ 6 , 1], can be considered as a special case of the Gaussian 
mixture distribution, with ir k = e~ A j^r an ^ Ik = fc (^ n as K — > 00. In practice, only the first 
few significant terms are retained. 

B. Cyclo stationary Impulsive Noise Modeling 

While asynchronous impulsive noise dominates the higher frequency bands, recent studies 
showed that cyclostationary noise is the main impairment in the 3 — 500 kHz range ][2j, ][6j, 

5 



X Receiver ifr Interferer i , l Ji 1 1, l Interference Emissions 






= 7o 




ii.l. iii 


,u, i>,i 






■ * 




I " 




,u, lii 







M:s, A 3 ,':s 











LlJl 111 


,1.1, lil 










* 



I ill 1 1, 1 



Case I: Dominant 
Interference Source 



Case II: Homogenous PLC 
Network 



Case III: General PLC Network 



Fig. 1. Interference scenarios in PLC networks. Each interferer emits a random sequence of emissions onto the powe line, 
which add up at the receiver. Each interferer is described statistically by a mean number of emission events fi, mean duration 
between emission events A, and the pathloss to the receiver 7. 



Scenario 


Example Network 


Statistical Model 


Dominant Interferer 


Rural Area 
Industrial Area 


Middleton Class A: 

A = Xfj, , fi = AyE [h 2 B 2 ] /2 


Homogeneous PLC Network 


Urban Area 
Residential Buildings 


Middleton Class A: 
A = MX/i , fi = AjE [h 2 B 2 ] /2M 


General PLC Network 


Dense Urban Area 
Commercial 


Gaussian Mixture: 
7T fe and 7fc given in I7J 



TABLE I 



Statistical-physical models of interference in PLC networks categorized by network types. 
Parameters are given in Fig. Q] and M is the number of interferers. 



[16|. The cyclo stationary noise has periodically varying statistics with the period equal to half 
the AC main's cycle. A trace collected from a low voltage site is given in Fig. [2} 

The spectrogram in Fig. [2] indicates that the noise exhibits regions during each period where 
it appears stationary. This fact was exploited in [6| to propose a linear periodically time varying 
(LPTV) system model that divides noise samples in each period into M stationary regions 
{IZijfi^ Each region is characterized by a spectral shape and a corresponding linear time- 
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Fig. 2. The spectrogram of a noise trace at a low-voltage site. It highlights the cyclostationary nature of the noise both in time 
and frequency domain. 

invariant (LTI) spectrum shaping filter. Given this model, the cyclostationary noise n[k] can be 
generated as the convolution of an AWGN signal s[k] with an LPTV filter given by 

M 

n[k]=Y, h[k, r]s[r] = £ l ke1Zi ^ h,[k - r]s[r] (2) 

T 8=1 T 

where 1^ is the indicator function. 

III. Prior Work in Impulsive Noise Mitigation 
A. Asynchronous Impulsive Noise Mitigation 

Asynchronous impulsive noise has been widely seen not only in PLC but also in wireless 
communications systems. In both contexts, various parametric and non-parametric methods have 
been proposed to mitigate the effect of asynchronous impulsive noise in OFDM systems. 

In parametric methods, the receiver assumes a particular statistical noise model, and typically 
estimates the parameters of the statistical model during a training stage. For example, pre- 
filtering techniques based on the Alpha-Stable model have been proposed in | [i7) . While low 
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in complexity, their performance can deteriorate significantly as the constellation size increases. 
More promising results were reported in H, where two minimum mean-squared-error (MMSE) 
symbol-by-symbol estimators were derived assuming Gaussian mixture modeled impulsive noise. 



Additionally, in [ 18] an iterative decoding algorithm was introduced to cope with Middleton Class 
A impulsive noise. The advantage of parametric methods is that they lead to performance gains 
by exploiting information of the noise model and its parameters. However, they require extra 
training overhead and can suffer from performance degradation when the noise model and/or 
the parameters mismatch the possibly time varying statistics of the impulsive noise. 

Non-parametric methods, on the other hand, require no additional training or parameter 
estimation since they do not assume any particular noise model. This makes them more robust to 
various environments and more resilient to model mismatches. Taking advantage of the sparsity 
of the time-domain impulsive noise, [jT9j applied the compressed sensing (CS) techniques to 
estimate the impulsive noise from the null tones of the received signal. The algorithm was 
subject to a sufficient recovery condition that the number of impulses within an OFDM symbol 
does not exceed a threshold that is uniquely determined by the discrete Fourier transform (DFT) 
size and the number of null tones. However, for common OFDM system settings, the threshold 
turns out to be too restrictive for many impulsive noise environments where an OFDM symbol 
is subjected to multiple impulses during its duration. This CS-based approach was extended 
in p0| to a bursty impulsive noise detector that exploits the block- sparsity of the noise. The 
performance of the algorithm, however, is affected by parameters that should be ideally adapted 
to the number of noise bursts within an OFDM symbol, and the background noise level. 

Our work seeks to develop non-parametric asynchronous impulsive noise mitigation algorithms 
that are applicable to all impulsive noise scenarios. Towards this end, we extend the CS based 



algorithm in p9| to a sparse Bayesian learning (SBL) approach [ 1 1 1 for improved performance 
and robustness. 
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B. Cyclo stationary Impulsive Noise Mitigation 



Early research on cyclo stationary signal detection and extraction were primarily based on the 
theory of cumulant and cyclic spectrum [21]. In particular, the cyclic spectrum of a second-order 
cyclo stationary process contains harmonic peaks, which can be used for detection and extraction 
p2[ . Accurate estimation of the cyclic spectrum, however, generally require a large amount of 
data, which potentially limits its applications to real-time communications systems. 

Recent research on transmitter and receiver design has explicitly taken into account of the 
cyclo stationary noise. Some of the work attempted to optimize the transmitted signal and the 
receiver equalization under cyclo stationary noise [|9). In jT0|, a linear MMSE frequency domain 
equalizer (FDE) was derived assuming knowledge of the second-order noise statistics. Other 



work has been focused on cyclo stationary noise cancellation at the receiver. In p3| and [24|, it 



was demonstrated the use of linear adaptive predicton filters for cyclostationary noise estimation 



in NB PLC systems. More recently in [25|, it was suggested to parameterize the LPTV system 
model in Fig. [2] by a periodically switching autoregressive (AR) process, whose switching states 
and AR parameters can be inferred during training by Bayesian learning. Based on the estimated 
AR models, a periodically switching moving average (MA) filter can be prepended to the receiver 
for noise pre- whitening. These filter based algorithms, however, may suffer from the over-fitting 
problem especially in the simultaneous presence of the asynchronous impulsive noise. To improve 
the robustness against such outliers generally requires longer training sequences. 

Typical cyclostationary noise in PLC systems consists of high power noise bursts that sweep 
up to 30% of the period [6|, which is much longer than the duration of an OFDM symbol as 
specified in the standards. For example, the OFDM symbol duration in G3 is 231.7/zs in the 
US Federal Communications Commission (FCC) band, whereas 10% of the noise period (i.e. 
half the AC cycle) is 833.3/is in the US. In this situation, noise mitigation algroithms that span 
multiple OFDM symbols (e.g. a packet) are desirable. If the packet duration is much longer 



than the noise bursts, there is hope that we can recover the contaminated OFDM symbols using 
the information from the other impulsive noise free symbols. A time-domain block interleaved 
OFDM (TDI-OFDM) transceiver structure was proposed in JT2J to spread long noise bursts into 
shorter ones over multiple OFDM symbols. After deinterleaving at the receiver, each OFDM 
symbol is contaminated by a sparse noise vector in the time domain, similarly to the situation 
under asynchronous impulsive noise. Therefore the non-parametric techniques developed for 
asynchronous impulsive noise mitigation can also be applied to mitigate cyclo stationary noise. 

IV. System Model 

We first consider a conventional coded OFDM system whose complex baseband equivalent 
representation is shown in Fig. |3j At the transmitter, a binary data packet b is encoded into a 
codeword c. The codeword is then mapped to OFDM symbols, each with N — M data subcarriers 
and M non-data subcarriers. The non-data subcarriers are either null tones for spectral shaping 
and inter-carrier interference reduction, or pilots for channel estimation and synchronization. An 
OFDM symbol, denoted by x, is converted to the time domain by the inverse DFT (IDFT). 
A cyclic prefix (CP), assumed to be longer than the channel delay spread, is inserted to the 
beginning of each OFDM symbol to prevent inter-symbol interference (ISI). 

At the receiver, we remove the CP from the received OFDM symbols, resulting in 

r = HF*x + e + n, (3) 

where F is the iV-point DFT matrix, H e C NxN is the convolutional matrix of the channel 
and is circulant due to the cyclic prefix insertion, and e, n e represent impulsive noise and 
AWGN, respectively. The OFDM demodulator takes the DFT of r, leading to 

y = FHF*x + Fe + Fn 

= Ax + Fe + g. (4) 
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Fig. 3. A conventional baseband coded OFDM system. 
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Fig. 4. A time-domain interleaved OFDM system. The interleaver II interleaves the OFDM symbols at the sample level while 
the de-interleaver II - reverses the operation. The bursty impulsive noise only passes through II" 1 which spreads the pulses in 
the time domain and makes them appear sparse to the receiver. 



Here A = FHF* is a diagonal matrix, with {Hj} i=1 (the iV-point DFT coefficients of the 
channel impulse response) on its diagonal, and g = Fn is the DFT of n and is also AWGN 
since F is unitary. 

In the presence of bursty noise spannning multiple OFDM symbols, we adopt the time-domain 
block interleaved OFDM transceiver structure as in [ 12 1 (Fig. |4]). At the transmitter we interleave 



multiple OFDM symbols in the time domain at the sample level before CP insertion. At the 
receiver, after CP removal, we apply a frequency-domain channel equalizer (FEQ) to the received 
signal. The equalized signal is then deinterleaved in the time domain before converted to the 
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frequency domain by DFT. Assuming perfect channel estimation, the demodulated OFDM signal 
y can be expressed as 

y^x + Fe' + g'. (5) 

Here e' denotes the impulsive noise after deinterleaving, and g' is spectrally- shaped Gaussian 
noise due to the FEQ. We suppose that the interleaver size is large enough to spread the long 
noise bursts into short pulses over multiple OFDM symbols, so that e' is sparse. For simplicity, 
we also approximate g' as AWGN. With these assumptions, (|5]) can be considered as a special 
case of (|4]) with a flat channel (i.e. A = I). Therefore, the following derivations will apply to 
both @ and ©. 

Let X denote the index set of the null and pilot tones, where |X| — M < N. Also, let {-) x 
denote the sub-matrix (or sub-vector) corresponding to the rows (or elements) indexed by the 
set X. Assuming perfect channel estimation, i.e. complete knowledge of A, the impulsive noise 
can be observed from the null and pilot tones of the received OFDM symbol, since 

z = yi - (Ax)! 

= F x e + gx, 

g x ~ CAf(0,a 2 I M ). (6) 

The recovery of the length- N vector e from the noisy underdetermined Mx N linear system is 
generally an ill-conditioned problem. However, exploiting the sparse nature of e (since impulsive 
noise has very few non-zero samples in the time domain), we can possibly get an accurate 
estimate of e by applying various compressed sensing techniques. 

We would like to use the estimated impulsive noise to improve the detection of x. More 
specifically, the impulsive noise estimate e can be subtracted out from the received symbol on 
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the data tones to form a new decision metric 

= (Ax) T + g r + F r (e-e). (7) 

where (■) indicates set complement and thus X indicates the set of data tone indices. Assuming 
that e»e, the receiver can then proceed as if only Gaussian noise were present and apply the 
conventional detection and decoding algorithms. 

V. Non-parametric Impulsive Noise Estimation 

The estimation of impulsive noise converts to solving an underdetermined linear regression 
problem in ([6]) under sparsity constraints. Among various compressed sensing algorithms, sparse 
Bayesian learning (SBL) has recently obtained much attention since they generally achieve the 
best recovery performance. Therefore we apply the SBL techniques and propose three non- 
parametric algorithms for impulsive noise estimation. Before introducing the algorithms, we 
briefly describe the SBL framework. 



A. Sparse Bayesian Learning 



SBL was first proposed by Tipping J26|, and was introduced to sparse signal recovery by 



Wipf and Rao in [11 1. Generally, SBL is a Bayesian learning approach for solving the linear 
regression problem 



t = $w + v, v ~ CN(0, a 2 l 



M ) 



(8) 



N 



where t 6 C M is an observation vector, <fr = $ x 
(i.e. M < N), and w E C N is a sparse weight vector to be estimated. 
SBL imposes a parameterized Gaussian prior on w 

p(w;r)=CJV(w;0,r), 
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G C is an overcomplete basis 



(9) 



where T = diag{j}, and 7 £ 1Z N whose i-ih. component 7^ is the variance of lOj. Given the 
prior, the likelihood of the observation can be expressed as 



p(t; T, a 2 ) = CjV(t; 0, $T$* + a 2 I M ). (10) 



A maximum likelihood (ML) estimator solves the hyperparameters 7 and cr 2 that maximize ( fTO] ). 
The ML estimation is computed iteratively using expectation maximization (EM). 

Given the observations and the estimated hyperparameters, the posterior density of e is also 
a Gaussian distribution 

p(w|t;r,a 2 ) = CN(w;ijl w ,'E v> ), 

fJL w = CT~ 2 £™$*t, 

= (a-^Q + T- 1 )- 1 . (11) 

The maximum a posteriori (MAP) estimate of w is the posterior mean fi w . 

Due to the sparsity promoting property of the prior, upon convergence most components of 7 
and hence fjt w are driven to zero, rendering a sparse estimate of w. SBL has been proven to be 



more robust compared to the Basis Pursuit [27] and FOCUSS [28 ] algorithms, since the global 
optimum is always the sparsest solution, all local optimal solutions are sparse, and the number 
of local optima is the smallest. 



B. Estimation Using Null and Pilot Tones 

The SBL technique can be directly applied to the impulsive noise estimation using null and 
pilot tones, since substituting t = z, $ = Fx, w = e, and v = gi into ([8]) gives exactly (|6]). 

To obtain the ML estimates of the hyperparameters 7 and a 2 , we treat e as the latent variable 
and apply the EM algorithm. The update of the hyperparameters in the k-th iteration is as follows, 
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where we define 6 = (7, cr 2 ) for conciseness. 



7, 



(fc+i) 



argmax E. gW [log p(z, e; (k) )} 
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v (fc) -j- r« 
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(13) 
(14) 
(15) 



Upon termination of the EM algorithm, we obtain the MAP estimate of the time-domain 
impulsive noise e = fj, e . We then transform e to the frequency domain and subtract it from the 
received signal on the data tones according to @. 

C. Estimation Using All Tones 

As will be demonstrated in the simulation results, performance of the estimator using null 
and pilot tones is improved as the number of these non-data tones increases. However having 
fewer data tones means reduced throughput. When the number of non-data tones is limited, it 
is desirable to exploit information available in all tones to estimate the impulsive noise. To do 
this, we define u = Ax + g, and rewrite as 



z 








= Fe + 











u x ~ CN \0,a 2 I 



M 



-Mj- 
IS 



(16) 



Imposing the parameterized Gaussian prior on e, i.e. p(e; T) = CAf(0, T), the likelihood of z 
remains the same as pO] ), while the likelihood of yj, the received signal on the data tones, is 

p(y r ,(\x) T ,T,a 2 ) = CAf(y T ;(Ax) T ,V y -), 

S y _ = F T TF* T + a 2 l N ^ M} (17) 

with the unknown transmitted signal as a third hyperparameter in addition to V and a 2 . 
Although consists of constellation points, which are discrete, we temporarily relax it to be 
continuous when treated as a hyperparameter in the EM algorithm. 

The iterative updates in the EM algorithm now involve all three hyperparameters. Since each 
hyperparameter is updated while keeping the others fixed, the update equations for 7 and a 2 are 



in the same forms as ( fT2| ) and ( fT3| ). We treat (Aa?)j in a whole as a hyperparameter and update 
it as 

(Ax) { T k+1) = argmaxE e|y . 0(fe) [log p(y,e;0^)} 



(Aa>) 



x 



= argmin |yj — (Ax)j — Fjfi 

(Ax) T 

= y T -F Tf iJ k \ (18) 
The entire EM algorithm is summarized as follows. 

7 ? +1) = argmaxE, y . w[log p(y,e;0^)} 

7l >o |y ' 

= S e « + ( Me f) 2 , (19) 



l(||y-Ax( fc )-F^)|| 2 + 

AT 

(^^Ell-lf)- 1 ^, (20) 
i=i 

(Ax)f +1) = yr-F T eW, (21) 
Se (*) = r W _ r (fe) F * E -i Fr (fe) j ( 22) 

e (k) = ^ = -^^F*(y-A^). (23) 
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D. Decision Feedback Estimation 

The two estimators described above impose a parameterized Gaussian prior on the time- 
domain impulsive noise, i.e. e ~ CAf(0,T). Prior information on T, or equivalently on the 
precision matrix P = r~ , can be introduced by the conjugate prior distribution on these 
hyperparameters. Let p = [pi, • • • ,Pn] t denote the diagonal of P. The conjugate prior on 
p is a Gamma distribution 

N 

P(p;a,b) = JjGafo; ai,&i). (24) 
i=i 

where Ga(-;a, b) is the Gamma distribution with parameters a and b. When cij = 0,6j = 0,Vi, 
<[24j) reduces to a uniform distribution, which is an uninformative prior that is implicitly im- 
posed in the previously described SBL framework. Non-zero values of di and &j contain prior 



information that is integrated into the likelihood function in ( [TO] ), resulting in 



p(z;P,a 2 ,a,b) = CAf(z; 0, F X P X FJ + a 2 I 



M) 



Ga(P;a,b). (25) 



Differentiating (25) with respect to log p if and setting the derivatives to zeros, we obtain 



P * = 1 * = M.J + + 26, ■ (26) 



Comparing (26) to (12), we can see the prior information contained in a$ and 6j does affect the 



ML estimates of 7. Since ( |26| ) is the conjugate prior on p, the posterior probability of p given 
e, a and b is also Gamma distributed, i.e. 

N 

P(p|e; a, b) = JJ Gafe; a i; b { ) (27) 



with the updated parameters 



1 

6, = &, + ^- (28) 
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Fig. 5. A decision feedback impulsive noise estimator. 



Suppose that in addition to the MAP estimate e given by the estimator using non-data tones, 
a second estimate of e, denoted by e, is available based on certain side information. The side 
information contained in e can be fused into e via the posterior distribution of p given e. More 



specifically, given e, we update a and b according to (28), and then solve the ML estimate of 



p ( |26| ) with the updated a and b. 

In coded OFDM systems, the redundancy in the coded data tones can be exploited as the 
side information to provide a second estimate of e. More specifically, the decoder takes the 
OFDM symbols after impulsive noise mitigation as the input, and produces hard decisions on 
the uncoded and coded bits, b and c, respectively. Using c we can recover the data tones of the 
OFDM symbols by appropriate constellation mapping. This gives an estimate of xj, which is 
multiplied by the channel frequency response A, transformed to the time domain and subtracted 
from the received signal r to generate the estimate e. Then we use e to update a and b, through 
which the information extracted from the coding redundancy is transferred back to the impulsive 
noise estimator. As such, we form a decision feedback estimator that transfers information back- 
and-forth between the impulsive noise estimator using non-data tones and the decoder using data 



tones (Fig. [5]). Compared to the estimator using all tones in Section V-C the decision feedback 



estimator is expected to have better performance by exploiting the redundant information (i.e. 
coding structure) on the data tones. 
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VI. Low-Complexity Implementation 

The core SBL algorithm in ( flT) involves an M x M matrix inversion (M is the number of null 
and pilot tones) which might be practically infeasible on a hardware platform. An accelerated 
version of SBL that utilizes the properties of the marginal likelihood has been proposed in 



|29|. Using these properties, the accelerated SBL algorithm performs a sequential addition and 
deletion of candidate basis functions given by the columns of <3? in ([8]), while keeping the same 
reconstruction performance p9|. 



Algorithm 1 Sequential SBL algorithm |29| 



l: Initialize background noise variance a 

1 1 2 

2: Select f with largest projection ||foy|| on the observed vector y 

3: Compute «o = (||foy|| 2 — cr 2 ) \ all other acj = oo (exclude from model) 

4: Compute £ and fi as given by (29j 

5: while A<x, < threshold , Vj do 

6: Select a candidate basis fj from columns of F 

7: Compute 9i = \\qi\\ — Sj 

8: if 6,; > and as < oo then 



9: fj is in model, re-estimate a, as given in [29] 

10: else if 9i > and ati = oo then 
11: fj not in model, add fj and update parameters 

12: else 

13: fj is in model, remove fj and update parameters 

14: end if 

15: Update noise variance a 2 as given in [29 1 
16: end while 
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In the context of impulsive noise estimation, an impulse at noise sample i is represented as 
a non-zero entry in vector e at index i. As a result, it contributes e^f, to the observed output 
y, where we denote the i-th column in Fx as £j. The accelerated algorithm will sequentially 
add, remove, or update basis fj until convergence. At convergence, the basis that remain in the 
model will indicate the support of vector e and thereby the locations of the impulses. Algorithm 
[T] presents a high-level description of the sequential algorithm. Due to the space constraints, we 
refer interested readers to p9| for the mathematical details. 

Significant computational savings can be obtained if the background noise power a 1 is known. 
This will allow for efficient calculations in steps |9} TTj 13 in algorithm [T] without any matrix 
inversions as given in [29|. TABLE [n] compares the complexity per iteration of the original 
SBL-based algorithms and the sequential implementation of it. The computational complexity of 



the original algorithms is dominated by the matrix multiplication and inversion operations in ( 13 ) 
and ( |2"0"| ). Compared to the estimator using non-data tones, the estimator using all tones increases 
the complexity from 0(N 2 M) per iteration to 0(N 3 ) per iteration, where N is the DFT size. 



Estimator 


Operation 


Complexity 


Using null and pilot Tones 


Matrix multiply 


0(N 2 M) 


Matrix inversion 


0(M 3 ) 


Using all tones 


Matrix multiply 


0(N 3 ) 


Matrix inversion 


0(N 3 ) 


Sequential SBL 
w/ unknown background noise power 


Matrix multiply 


0(N 2 K) 


Matrix inversion 


0{K Z ) 


Sequential SBL 
w/ known background noise power 


Matrix multiply 


0(N 2 K) 



TABLE II 



Complexity per iteration of the proposed algorithms. TV is the FFT size, M is the number of null and 

PILOT TONES, AND K IS THE NUMBER OF MODEL BASIS IN THE CURRENT ITERATION. 
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Parameters 


Simulation 


G3 Standard 


FFT Length 


128 


256 


Modulation 


QPSK 


DPSK (|C|=2,4,8) 


# of Tones 


128 


128 


# of Data Tones 


72 (#33 - #104) 


72 (#33 - #104) 


# of Null Tones 


56 


56 


Forward Error 
Correction Code 


Rate- 1/2 Convolutional 


Rate- 1/2 Convolutional 
+ Reed Solomon 


Interleaver 


Random interleaver 
in time domain 


s-interleaver 
In frequency domain 


Interleave Size 


100 OFDM symbols 


Entire packet 



TABLE III 

Parameters of the simulated OFDM system and the G3 standard. \C\ denotes the constellation size. 



On the other hand, each iteration of the sequential SBL involves matrix multiplications and 
inversions that have complexities of 0(N 2 K) and 0(K 3 ), respectively, where K is the number 
of model basis in that particular iteration. Furthermore, it can make use of the knowledge of 
background noise power to eliminate any matrix inversion operations. 

VII. Simulation Results 

To quantify the performance of our proposed algorithms, we simulate a baseband OFDM 
system over a flat channel. Some parameters of the OFDM system are listed and compared to 
the G3 PLC standard in Table HID 

We generate the asynchronous impulsive noise from two different statistical models: a 3- 
component Gaussian mixture (GM) distribution with n = [0.9, 0.07, 0.03] and 7 = [1, 100, 1000], 
and a Middleton Class A (MCA) distribution with A — 0.1, Q — 0.01, and the probability density 
function truncated to the first 10 mixture components. The noise samples are assumed to be i.i.d. 
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In addition, we synthesize cyclostationary noise using the LPTV model proposed in [6|, with 
the filter spectrums as in Fig. [2] 

We compare our proposed algorithms with the non-parametric CS and least squares based 
algorithm in [[T9j (denoted as CS+LS in the following). The two parametric MMSE detectors 
in [[8j are also implemented to provide baseline references in the asynchronous impulsive noise. 
Both MMSE detectors assume perfect knowledge of the Gaussian mixture impulsive noise model, 
and one even assumes perfect knowledge of the noise variance at each time instance (i.e. noise 
state information, or NSI). The algorithms are simulated with the convolutional code switched on 
and off. We measure the bit error rates (BER) over 10 4 OFDM symbols, which contain 1.44 x 10 6 
and 7.2 x 10 5 bits in uncoded and coded systems, respectively. The BER performance of the 
algorithms in uncoded and coded systems in different impulsive noise scenarios are plotted in 
Fig. § and |7j 

In the uncoded system, our proposed estimator using null tones achieves 6-8 dB SNR gain 
over conventional OFDM receivers in the asynchronous impulsive noise. We have got additional 
1-2 dB gain in a relatively wide SNR region by using all tones. We notice a marginal performance 



System 


Noise 


SBL 
w/ Null Tones 


SBL 
w/ All Tones 


SBL 
w/ Feedback 


Uncoded 


GM 


8 dB 


10 dB 




MCA 


6 dB 


7 dB 




Coded 


GM 


2 dB 


7 dB 


9 dB 


MCA 


1.75 dB 


6.75 dB 


8.75 dB 


Cyclic 


2 dB 


5 dB 


9.25 dB 



TABLE IV 

SNR GAINS OF THE PROPOSED IMPULSIVE NOISE MITIGATION ALGORITHMS OVER THE CONVENTIONAL OFDM SYSTEM 

WITHOUT DENOISING. 
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Fig. 6. Uncoded (top row) and coded (bottom row) BER performance of proposed algorithms in Gaussian mixture (left 
column) and Middleton Class A (right column) modeled asynchronous impulsive noise, in comparison with the conventional 
OFDM system without denoising, the compressed sensing based algorithm (19) , and two parametric MMSE estimators (HJ. 



loss of the estimator using all tones in low SNR regimes. This is because at lower SNRs, the errors 



introduced by the continuous relaxation of constellation points x (see Section V-C ) become more 
significant. Compared to the parametric MMSE estimators, our proposed estimators outperform 
the MMSE estimator without NSI in moderate and high SNR regimes. 

In the coded system, the proposed estimator using null tones can achieve up to 10 dB SNR 
gain over conventional OFDM receivers in the asynchronous impulsive noise. The estimator 

23 



5 — ©— No cancellation 

-A-SBLw/nulltones 

-V-SBL w/ all tones 
. —0— Decision feedback SBL 

— b 1 1 | | | ' | ! | | 

-6 -4 -2 2 4 6 8 10 
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Fig. 7. Coded BER performance of proposed algorithms in cyclostationary impulsive noise. 



using all tones provides an additional 2-5 dB gains. Furthermore, using decision feedback from 
the convolutional decoder, we obtain an extra 2 dB gain. In the cyclostationary impulsive noise, 
we achieve around 5 dB SNR gain with the estimator using null tones, and an additional 2-4 dB 
gains by using all tones. The decision feedback estimator in this case has an extra 4 dB gain 
over the one using all tones. 

For clarity purposes, we measured the approximate SNR gains of the proposed algorithms 
over the conventional OFDM system without denoising at a target BER of 1CT 4 , as listed in 
TABLE [IVJ 

In Fig. [6| we can see that the compressed sensing based (CS+LS) algorithm performs worse 



than our proposed estimators in both noise scenarios. As mentioned previously in Section III 
this is because the CS algorithm can only recover the impulsive noise with high sparsity, i.e 
typically less than 5 impulses per OFDM symbol in our system settings. 
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VIII. Conclusion 



This paper proposes three methods for improving communication performance of OFDM PLC 
systems in the presence of asynchronous impulsive noise and cyclo stationary impulsive noise. 
To mitigate asynchronous impulsive noise, we apply sparse Bayesian learning (SBL) techniques 
to estimate the impulsive noise from the received signal by observing information either on the 
null and pilot subcarriers or on all subcarriers. Under cyclo stationary noise, we adopt a time- 
domain interleaving OFDM transceiver structure to break long noise bursts that span multiple 
OFDM symbols into short bursts, and then apply the SBL techniques. All the methods are non- 
parametric, i.e. do not require prior knowledge on the statistical noise model or model parameters. 
We validate the proposed algorithms based on Gaussian mixture and Middleton Class A modeled 
asynchronous impulsive noise, and the cyclo stationary impulsive noise. 
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